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An Improved Upper Bound for the Most 
Informative Boolean Function Conjecture 
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Abstract 

Suppose X is a uniformly distributed n-dimensional binary vector 
and Y is obtained by passing X through a binary symmetric channel 
with crossover probability a. A recent conjecture by Courtade and 
Kumar postulates that /(/(X); Y) < l—h{a) for any Boolean function 
/. So far, the best known upper bound was I{f{X);Y) < (1 — 2a)^. 
In this paper, we derive a new upper bound that holds for all balanced 
functions, and improves upon the best known bound for all | < a < p 


1 Introduction 

Let X be an n-dimensional binary vector nniformly distribnted over {0,1}”, 
and Y be the ontpnt of passing each component of X throngh a binary 
symmetric channel with crossover probability a < 1/2. The following was 
recently conjectnred by Courtade and Kumar PQ. 

Conjecture 1 For any Boolean function f : {0, !}"■ i— )■ {0,1} it holds that 

J(/(X);V)<l-h(a), (1) 
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supported by an ERC grant no. 639573, a CIO grant no. 631983, and an ISP grant no. 
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where I{f{X); Y) is the mutual information between f{X) andY, and h{p) = 
—plogp — {1 — p) log(l — p) is the binary entropy function^ 

For a dictatorship function, f{X) = Xi the conjectured upper bound ([T]) 
is attained with equality. Therefore, the conjecture can be interpreted as 
postulating that dictatorship is the most “informative” Boolean function, 
i.e., it achieves the maximal I{f{X)-,Y). 

So far, the best known bound that holds universally for all Boolean func¬ 
tions is 


/(/(A');y)<(l-2a)l (2) 

This bound can be established through various techniques, including an ap¬ 
plication of Mrs. Gerber’s Lemma M, the strong data-processing inequal¬ 
ity EE] and standard Fourier analysis as described below. 

In this paper, we derive an upper bound on /(/(X); Y) that holds for all 
balanced functions, and improves upon ([2]) for all | < a < |. Specihcally, 
we obtain the following result. 

Theorem 1 For any balanced Boolean function f{x) : {0,1}" i—)■ {0,1}, and 
any | Y a < ^, we have that 

-)(l-2a)‘. (3) 

For the proof of Theorem [H we hrst lower bound the conditional en¬ 
tropy H{f {X)\Y) in terms of the second and fourth moments of the random 
variable (1 - 2P/), where P/ = Pr (/(X) = 0|X = y). Specihcally, we show 
that 

^ E(1 - 2Plf. (4) 

To upper bound the second and fourth moments in (jlj), we use basic Fourier 
analysis of Boolean functions along with a simple application of the Hyper- 
contractivity Theorem M, in order to derive universal upper bounds on 

^All logarithms are taken to base 2. 


H(f(X)\Y) > 1 - i^E(l - 2Plf + (l - 


/(/m;r)<i^(l-2a)^ + 9(l-M 
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E(1 — 2PyY^ that hold for all balanced Boolean fnnctionsU In particnlar, 
these bonnds show that E(l—2Py)^ < (1—20;)^ and E(l—2^^)"^ < 9(1—20)“^. 
Plngging these bonnds in (jl]) yields the Theorem. 

An appealing featnre of the new npper bonnd, is that the ratio between 
the RHS of (Ej) and 1 — h{a) approaches 1 in the limit of a —)■ For the 
bonnd ([2]), on the other hand, the same ratio does not approach 1. Rather 


lim 


(1 -2q)^ 
if — h{a) 


2 

log(e) 


^ 1.3863. 


2 Preliminaries 

To prove onr results, it will be more convenient to map the additive group 
{0,1}" to the (isomorphic) multiplicative group {—1,1}"^. Specihcally, let X 
be an n-dimensional binary vector uniformly distributed over { — 1,1}” and Y 
be the output of passing each component of X through a binary symmetric 
channel with crossover probability a < 1/2. Thus, for all i G {1, • • • ,n} 
we have Yi = Xi ■ Zi, where is an i.i.d. sequence of binary random 

variables statistically independent of with Pr(Zj = —1) = a and 

Pr(Zj = 1) = 1 — a. Note that Y is also uniformly distributed over { — 1,1}". 

Let / : {—1,1}'^ {“1) 1} be a balanced Boolean function, i.e., 

Pr(/(X) = -l) = l 

Note that this condition is equivalent to Kx{f{X)) = 0. For each y G 
{—1,1}"' define the posterior distribution of f{X) given the observation y 

F/^Pr(/(X) = -l|R = |/), 


and note that 


nj{X)\Y ^y) = l-2P’. (5) 

^We remark that using similar techniques it is possible to obtain an upper bound on 
I{f{X);Y) of a similar form that holds for any g-biased function (i.e., any function for 
which E(/(2f)) = 1 — 2q). However, the obtained bound is not maximized at g = ^, and 
therefore cannot be used to establish ([3|) for all Boolean functions. 
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In what follows, we will extensively use Fourier analysis of real functions 
on the hypercube {—1,1}"^. Let X be a random vector uniformly distributed 
on {—1,1}” and dehne [n] = {1, 2,..., n}. The Fourier-Walsh transform of 
a function / :{ —Ijlj^i-^Ris given by 

/(^) = XI 

S'C[n] i^S 

where 

/(S) = Ex (/(X)J]X.) (7) 

\ i&S / 

is the correlation of / with the “parity” function on the subset S. It is easy 
to verify that the basis { 995 ( 0 ;) = riies^^lscin] orthonormal with respect 
to the inner product < f,g >= ¥>{f{X)g{X)), which implies that for any 
two functions /, : { — 1 , 1 }” i-)- M it holds that 

E(/(X) 9 (X))= 5 ^/(S)j(S). ( 8 ) 

SC[n] 


In particular, Parseval’s identity gives f'^iS) = E(/^(X)). Thus, if / is 
a Boolean function, i.e., / : {—1,1}"' e-)- {—1,1}, we have f'^{S) = 1. 

The following proposition gives three simple properties of the Fourier 
coefficients that will be useful in the sequel. 

Proposition 1 (Basic properties of the Fourier transform) Let/: ^ 

{ —1,1} be a Boolean function. We have that 

(i) f is balanced if and only if fifb) = 0; 

(ii) If\f{S)\>t)then\f{S)\>2--; 

(Hi) Any balanced function f which is not a dictatorship function must sat- 
^sfyEs:\s\>2f\S)>0; 

Proof. 

(i) By dehnition /(0) = E/(X) = 0 for / balanced. 
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(ii) We note that the sum J2xg{-i i}« f Ylies integer, and the 

claim follows immediately. 

(hi) Let / be a balanced Boolean function (/(0) = 0) with |^|>2 PiS) = 
0. Then f{X) = J2i&[n] Since there always exists some x G 

{—1,1}"- for which f{x) = must have that l/({*})l = 

1. On the other hand, by Parseval’s identity we have jUP) = 1- 
Clearly, the last two equations can simultaneously hold if and only 
if there is a unique i for which |/({i})| > 0. Hence, / must be a 
dictatorship. 


For any function / : { — 1,1}"" M with Fourier coefficients {/(S')} and 

p G M"*", the noise operator Tpf : {—1,1}" M is dehned as [7] 

(r,/)(x) A ^/(Sjpisijqi. (9) 

SC[n] i^S 

Recall that in our setting X and Y are the input and output of a binary 
symmetric channel with crossover probability a. Thus we can use the Fourier 
representation of / to write E {f{X)\Y = y) as follows |7] 


E(/(A-)|y = !/) 


EI Y. \'^ = y 

,SC[n] 


i&S 


Y I IfK^i 

5C[n] 


\i&S 


Y /(s)E n» 

5C[n] \i&S J i&S 

Y /(s)n®(^')n»i 

SC[n] ies ies 

Y /(S)(l-2a)l»in». 

SC[n] ies 

{Tl-2af){y), 


( 10 ) 

( 11 ) 
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where we have used the fact that {Zi} is an i.i.d. sequence. Recalling equa¬ 
tion dS]), this also yields 

l-2P/ = (r,_2„/)(!,)= /(S)(l-2a)l‘^in!/.- (12) 

5C[n] ieS 

The following well-known theorem will play an important role in the 
derivation of Theorem [T] 

Theorem 2 (Hypercontractivity Theorem, |[7Hn|) Let 1 < p < g < 

oo. Then for all p < all g : {—1,1}"' M, it holds that 

[mTp 9 ){X)n]" < [E{\g{X)n]K (13) 


3 Proof of Theorem [T] 


In this section we prove Theorem [H which gives a universal upper bound 
on the mutual information /(/(X);T), that holds for any balanced Boolean 
function. The mutual information can be expressed as 


/(/(A'); A) = H{J{X)) - H(J(X)\Y) = 1 - Eyft(pl). 
We note that h(-) admits the following Taylor series 

T — p' 


h 


i-E 


log(e) yfe 


^^2k{2k-iy ’ 

and can be lower bounded by replacing with for all k > t, i.e., 

log(e) ^2k log(e) 




t-i 


—p2*^ - p2* 

^^2k{2k-iy ^ ^^2k{2k-l) 


k 
t-1 


_ ^ log(e) 2k 

^^2k{2k-lf 


t-i 




log(e) 


^ 2k{2k - 1) 


(14) 


(15) 


(16) 
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where we have used the fact that h(0) = 0. Using (flB]) . for any t G M, we 
can further lower bound Eyh(Py) as 


Eyh(Py) = Eyh 


1 - (1 - 2PI.) 




-h-E 


log(e) 


t-i 


^ 2k{2k - 1 ) 


Ey ((1 - 2 P/^ 2 t 


Y) 


(17) 


Thus, any upper bound on the hrst t even moments E[(l — 2Py)^^], k = 
1,... ,t, would directly translate to an upper bound on /(/(X); Y). 

We obtain an upper bound on all even moments of the random variable 
1 —2Py, using a simple trick combined with the Hypercontractivity Theorem. 
Formally, we prove the following lemma. 


Lemma 1 Let k >1 be an integer satisfying (1 — 2a)^2k — 1 < 1. For any 
balanced Boolean function f : { — 1,1}” { — 1,1} we have that 

Ey (^(1 - 2Plf^^ < {2k - 1)^(1 - 2af^. 

For the proof, we will need the following proposition. 


Proposition 2 For any balanced Boolean function f : {—1,1}” {—1,1} 

and any 0 < p < 1 we have that 

Ev ((TJf(Y)) < p\ 

Proof. By the definition of the operator Tp\ 


Ey{(TJ)\Y))=EY I I 5^ 

,5C[n] 




Ey( 5^ E 

'^5iC[n] i&Si S2C[n] j&S2 
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( 18 ) 


= E E /(■s2)p'®='Ey n n 

SlC[n] S2C[n] ViSSi iSS2 

= E E /(■S2)p''"’'la.a 

SiC[n\ 52C[n] 

= E 

SC[n] 

where lsi =52 is the indicator function on the event Si = S 2 . Recalling that 
X] 5 c[n] = i- assumptions that p < 1 and that /(0) = E(/(X)) = 0, 

the “weight” assignment /^(S') that maximizes X] 5 c[n] puts all the 

weight on characters S whose cardinality is 151 = 1. Hence, 

E < pt 

SC[n] 


as desired. ■ 

Proof of Lemma [H Let / : { — 1,1}” 1 —)■ {—1,1} be a Boolean function 
and let g = Ti - 2 af ■ Let fc > 1 be an integer, and let p = ijl/{2k — 1) = 
V(2-1)/(2A)-1). By m , we have 

Ey((l - 2Pp“) = Ey (9“(y)) 

= Ey ((r,(ri/,9))"'' (y)) 

< [Ey((T,/,9)2(y))]'-' (19) 

= [Ey((r,i_,„,95f^/)=(y))]‘, (20) 

where flT^ follows from the Hypercontractivity Theorem taken with p = 
2,q = 2k (as p satishes the premise of the theorem), and fl20|) from the dehni- 
tion of the function g. Invoking Proposition [2] with p = {l — 2a)\/2k — 1 < 1, 
we obtain that for any balanced Boolean function and any k>l 

Ey (^(1 - < {2k - 1)^(1 - 2a)2^ (21) 


as desired. ■ 

The following is an immediate consequence of Lemma [1] and flTT)) . 






Proposition 3 For any balanced Boolean function f : {—1,1}"" ^ 1}; 

any integer t > 1 and any | ^ — have that 

unxy, Y) < - 1)‘(1 - 2a)"‘ 

+ (22) 

Theorem [1] now follows by evaluating (j2^ with t = 2. Note that for 
balanced functions the upper bound (1 — 2a)^, which was the best known 
bound hitherto, is obtained as a special case of Proposition [3] by setting 
t = 1. It is easy to verify that for | < a < ^ the upper bound in Theorem [1] 
is tighter. See Figured] for a comparison between the bounds. 



a 


Figure 1: A comparison between the bound from Theorem dl the conjectured 
bound and best known previous upper bound on /(/(X); Y). Note that the 
bound from Theorem [1] is only valid for balanced functions. 
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Remark 1 /n |2l Appendix B, Remark 6], it is elaimed that Conjeeture [H 
ean he shown to hold in the limit of a —?• |. Theorem [I] demonstrates this 
fact for balanced functions, as for a ^ | the ratio between the RHS of ([3]) 
and the conjectured bound tends to 1. As we discuss below, a slightly stronger 
statement can be shown to hold. 

The next simple corollary of Theorem [1] establishes the optimality of the 
dictatorship function among all balanced functions in the very noisy regime. 
The proof is essentially a consequence of the discreteness of the space of 
Boolean functions. 


Corollary 1 (Dictatorship is optimal for very noisy channels) Letf : 
{0, !}"■ I—)■ {0,1} be any balanced Boolean function. There exists > 0 
such that I{f{X);Y) < 1 — h{a) for all a G |]. In particular, 

dictatorship is the most informative balanced function in the noise interval 
O G [^2 2 ] ' 


Proof. By equation flTTl) applied with f = 2, we have that for any balanced 
Boolean function (and any a), 


I Uixy, Y) < !^Ek ((1 - 2F/)") + 



Lemma [T] (applied with k = 2) implies that for | ~ | 

Ey (^(1 - 2PIY^ < (2 • 2 - 1)2(1 - 2a)^ = 9(1 - 2a)^ (23) 

Furthermore, by equations flT^ and fflSj) . we have that for a in this range 


Ey((l-2FpM= ^f(S)(l-2a)= 


SC[n] 


< 1 E P(S) (1 - 2af + Y. I (1 - 2a)\ (24) 

\s\=i J Visits 


where we have used the fact that /(0) =0 for balanced functions (Propo¬ 
sition [T]). Combining the above three inequalities and using the fact that 
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J2san]f^iS) = 1, yields 


I (f(Xy, Y) < ( 1 - 5^ f (S) (1 - 2af+ 

\ l>5|>2 / 

+ f9(l-!^)+!^5:f(S)Vl-2< (25) 

V ^ ^ |S|>2 / 

Now, suppose that / is a balanced Boolean function which is not a dicta¬ 
torship function. This implies (in fact, equivalent to) X]|s|> 2 /^(‘^) > 0- 
Proposition [H it therefore must be the case that 

\S\>2 


Therefore, for such /, the RHS of fl25|) can be upper bounded by 


log(e) 


(l-4-”)(l-2a)2+ 9 1 


log(e)^ ^ log(e) ^-r 


(l-2a)^ (26) 


It can be directly verihed that for any a G , where | ■ 2 ”, 

the expression in fl26|l is smaller than ~ Sa)^ < 1 — h{a), and thus by 

fl25|l . for such a we have 

I{f{Xy,Y)<l-h{a), (27) 

which completes the proof. ■ 


Remark 2 In an unpublished work, Sushant Sachdeva, Alex Samorodnitsky 
and Ido Shahaf have shown, using different techniques, that dictatorship is 
optimal among all (not just balanced) Boolean functions from {0,1}” to {0,1} 
for all a e |]. 


4 Discussion 

A natural question is: What are the limits of the approach pursued in this 
paper? To this end we note the following two limitations: 
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Firstly, the dictatorship function, which is conjectured to be optimal, 
satishes Ey ^(1 — j = (1 — 20;)^^ for every /c G N. The ratio between 

the bound in Lemma [T]on the fc-th moment of any balanced Boolean function 
and (1 — 20;)^^ grows rapidly with k. For this reason, we only get mileage 
from applying the lemma with k = 1,2 and not for higher moments. 

The second limitation is that Lemma [T] upper bounds each moment sep¬ 
arately, while we are seeking an upper bound on the entire distribution 
(weighted sum) of the moments: Quantifying the tradeoff between higher 
and lower moments seems to be one of the “brick walls” in proving the 
conjecture. For example, the dictatorship function has the largest second 
moment among all balanced functions, but it is not hard to see that the 
majority function, for example, has a much larger (relatively speaking) kth 
moment for very large values of k. To see this, note that for k S> 2”, 

Ey (^(1 - 2Plf^^ > 2"" ■ max |1 - 2P/p^ 

For the majority function, the maximum is attained at ?/ = (1,1,, 1..., 1) for 

which Pjj ^ 2 "■'^( 2 II") anfi consequently max^ |1 —2P^“-^| 1 —2 

For dictatorship, on the other hand, |1 — 2P/| = 1 — 2q; for every y, and 

therefore 

max |1 - ^ > max |1 - 2P“'=*|2^ 

Therefore, one cannot hope to prove that there is a single function that si¬ 
multaneously maximizes all moments; rather, the conjecture postulates that 
there is some tradeoff between these values and the largest mutual informa¬ 
tion is attained by functions that maximize lower moments at the expense 
of higher ones. 
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